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Abstract 

We prove a quantitative version of the Gibbard-Satterthwaite theorem. We show that 
a uniformly chosen voter profile for a neutral social choice function / of q > 4 alternatives 
and n voters will be manipulable with probability at least lCU 4 e 2 n~ 3 q~ 30 , where e is the 
minimal statistical distance between / and the family of dictator functions. 

Our results extend those of [FKN09], which were obtained for the case of 3 alter- 
natives, and imply that the approach of masking manipulations behind computational 
hardness (as considered in [B091, CS03, EL05, PR06, CS06]) cannot hide manipulations 
completely. 

Our proof is geometric. More specifically it extends the method of canonical paths to 
show that the measure of the profiles that lie on the interface of 3 or more outcomes is 
large. To the best of our knowledge our result is the first isoperimetric result to establish 
interface of more than two bodies. 
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1 Introduction 



Social choice theory studies methods of collective decision making, and their interplay with 
social welfare and individual preference and behavior. Rigorous study of social choice dates 
back to the 18'th century, when Condorcet discovered the following voting paradox: in a social 
ranking of three alternatives that is determined by the majority vote, an 'irrational' circular 
ranking may occur where a candidate A is preferred over a candidate B, B is preferred over 
C, and C is preferred over A. Social choice theory in its modern form was established in 
the 1950's with the discovery of Arrow's impossibility theorem [Arr50, Arr63], which showed 
that all social ranking systems that satisfy a few reasonable conditions must either obtain 
irrational circular outcomes, or be dictatorships (a dictatorship is a system where the ranking 
is determined by just one voter). 

Manipulations. Many of the results in the study of social choice are negative, showing that 
certain desired properties of social choice schemes cannot be attained. One of the hallmark 
examples of such theorems was proved by Gibbard and Satterthwaite [Gib73, Sat 75]. Their 
theorem considers a voting system where each of n voters rank q alternatives, and the winner 
is determined according to some pre-defined social choice function f ': L 1 } — > [q] of all the 
voters' rankings — here L q denotes the set of total orderings of the q alternatives. 

We say that a social choice function is manipulable, if a situation may occur where a voter 
who knows the rankings given by other voters can change her own ranking in a way that does 
not reflect her true preferences, but which leads to an outcome that is more desirable to her. 
Formally 

X 

Definition 1.1 (Manipulation point). For a ranking x £ L q , write a > b to denote that the 
alternative a is preferred by x over b. A social choice function f : L™ — )■ [q] is manipulable at 
x € L™ if there exist a y 6 L™ and i £ [n] such that x and y only differ in the i 'th coordinate 
and 

m > m (i) 

In this case we also say that x is a manipulation point of f , and that (x,y) is a manipulation 
pair for f . We say that f is manipulable, if it is manipulable at some point x. We also say 
that x is an r -manipulation point of f , if f has a manipulation pair (x, y) such that y is 
obtained from x by permuting (at most) r adjacent alternatives in one of the coordinates of 
x. 

Gibbard and Satterthwaite proved that any social choice function which attains three or 
more values, and whose outcome does not depend on just one voter, must be manipulable. 

Theorem 1.2 (Gibbard-Satterthwaite [Gib73, Sat 75]). Any social choice function /: L™ — >• 
[q] which takes at least three values and is not a dictator is manipulable. 

The Gibbard-Satterthwaite theorem has contributed significantly to the realization that 
it is unlikely to expect truthfulness in the context of voting. In a way, this and other results 
in social choice theory, contributed to the development of mechanism design, a field centered 
around developing social mechanisms that obtain desirable results even when each member 
of the society acts selfishly. 
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Quantitative social choice. Theorem 1.2 is tight in the sense that monotone social choice 
functions which are dictators or only have two possible outcomes are indeed non-manipulable 
(a function is non-monotone, and clearly manipulable, if for some set of rankings a voter 
can change the outcome from say a to b by moving a ahead of b in his preference). It is 
interesting, however, to study manipulation quantitatively, asking not just whether a function 
is manipulable but how many manipulations occur in it. To state results in quantitative social 
choice we need to define the distance between social choice functions. 

Definition 1.3 (Distance between social choice functions). The distance D(/, g) between two 
social choice functions f,g: L™ — > [q] is defined as the fraction of inputs on which they differ: 
g) = ~P[f(X) ^ g(X)], where X £ is uniformly selected. For a class G of social 
functions, we write D(/, G) = min^c D(/, g). 

We also define some classes of functions that may not have any manipulation points. 

Definition 1.4. We use the following three classes of functions, defined for parameters n 
and q that remain implicit (when used, the parameters will be obvious from the context): 

CONST = {/: L n q -> [q] \ f is constant } 
DICT.; = {/ : — > [q] \ f only depend on the i:th coordinate } , for i € [n] 
DICT = UjLn DICTj 

NONMANIP = {/ : 171 — V [q] \ f is either a dictator or takes at most two values} 
1.1 Our results 

Our results only apply to social choice functions which are neutral. A social choice function is 
neutral if it is invariant under changes made to the names of the alternatives (see Definition 2.1 
for a formal description). In our first main result we show the following lower bound on the 
number of manipulation points in a neutral social function: 

Theorem 1.5. Fix q > 4 and let f : L™ — > [q] be a neutral social choice function with 
D(/,DICT) > e. Then, 

e 2 

P(/ is manipulable at X) > - - — — (2) 

where X € L™ is selected uniformly. 

Note that the result above directly implies the following: 

Corollary 1.6. Fix q > 4 and let f:Lg—> [q] be a neutral social choice function with 
D(/,DICT) > e. Then, 



P((X,Y) is a manipulable pair for f) > 



e 2 



2nV(?0 3 ' 



where X G L™ is selected uniformly, and Y is obtained from X by uniformly selecting a 
coordinate i E {1, ..,n} and resetting the i'th coordinate to a random preference. 
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The result above has super exponential dependency on the number of alternatives q. A 
more refined analysis yields the following theorem. 

Theorem 1.7 (main theorem). Fix q > 4 and let f-.L™—* [q] be a neutral social choice 
function with D(/, DICT) > e. Then, 

P(/ is manipulable at X) > P(X is a ^-manipulation point of f) > ^ 4 3 3Q (3) 

where l£i™ is uniformly selected. 

A result similar to Theorem 1.7 was obtained for the case q = 3 in [FKN09], but the 
result of [FKN09] counted manipulation pairs rather than manipulation points. Translating 
the bound on the fraction of manipulation points in Theorem 1.7 directly to the case of pairs 
deteriorates the lower bound, inserting a factor of q\ in the denominator. However using the 
stronger bound on the fraction of 4-manipulation points, a direct corollary lower bounds the 
fraction of manipulation pairs of a certain kind while keeping the polynomial dependency on 
<?• 

Corollary 1.8 (manipulation pairs). Fix q > 4 and let f:L™—} [q] be a neutral social choice 
function with D(/, DICT) > e. Then, 

e 2 

P((X,Y) is a manipulation pair for /) > — (4) 

lCrn 4 ^" 34 

where X £ is uniformly selected, and Y is obtained from X by uniformly selecting a 
coordinate i € {1, --,n}, then selecting 4 adjacent alternatives in Xi and randomly permuting 
them. 

The case of large q, solved here, was left as the main open problem in [FKN09]. Their main 
motivation was that deriving quantitative versions of Gibbard-Satterthwaite theorems with 
polynomial dependency of q and n would indicate that from the computational complexity 
point of view it is easy on average to find manipulation points. This point is discussed in 
more detail in the related work subsection. 

Our lower bound for the number of manipulation points deteriorates polynomially with the 
number of voters, n, and the number q of alternatives. Some polynomial deterioration as 
a function of n is necessary. This can be observed by considering the plurality function 
pi : L™ — > [q], whose value is defined to be the candidate which is top ranked by the largest 
number of voters (break ties by picking the candidate which is top ranked by the 'leftmost' 
voter) . It is easy to observe that a point where no ties are formed is not a manipulation point 
of pi, and that for any fixed q the fraction of points that do contain ties is polynomially small 
in n. As for the dependency on q — we do not know whether it is necessary. 

1.2 History and related work 

The Gibbard-Satterthwaite theorem presented a difficulty in designing social choice functions, 
namely that of strategic voting. A line of research aimed at overcoming these difficulties sug- 
gested constructions of social choice functions where it is computationally difficult for a voter 
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to find beneficial manipulation [BTT89, B091, CS03, EL05]. However these constructions 
considered worst case analysis — they did not rule out the possibility that on average, finding 
a manipulation may be easy. Indeed, some results showed that finding manipulations is easy 
on average for certain restricted classes of social choice functions [PR06, CS06, Kel93] (see 
also the survey [FP10]). 

Recently, a result of Friedgut, Kalai and Nisan [FKN09] provided a very general result, 
showing that in the case of a neutral social choice function between 3 alternatives even 
a random attempted manipulation is beneficial for a voter with non-negligible probability. 
Adapted to our notation, the main result of [FKN09] can be stated as follows: 

Theorem 1.9 ([FKN09]). There exists a constant C > with the following property. Let 
f: L3 — > [3] be a neutral social choice function with D(/, DICT) > e. Then, 

e 2 

P((X, Y) is a manipulation pair for f) > C — (5) 

n 

where X G L3 is uniformly selected, and Y is obtained from X by uniformly selecting a 
coordinate i G {1, ..,n\ and resetting the i'th coordinate to a random preference. 

Choosing X, Y randomly as in Theorem 1.9, the result of [FKN09] implies that a manip- 
ulation pair is obtained with non-negligible probability (at most polynomially small in n), 
and thus a manipulation pair can be found efficiently as long as / can be efficiently eval- 
uated. Note however that the computational problem discussed above is different from the 
problem considered in previous work [B091, CS03, EL05, PR06, CS06], where the complexity 
studied was that of finding a beneficial manipulation for a specific voter, given the declared 
preferences of all other voters - since [FKN09] considers only three alternatives, a voter with 
access to the social choice function can easily try all permutations of the alternatives to find 
a manipulation. 

Corollary 1.6 and Corollary 1.8, which extend the result of [FKN09] to the case of 4 or more 
alternatives, are thus more relevant with respect to the hardness of finding a manipulation. 
They imply that in the case were votes are cast uniformly at random, a random change 
of preference for a random voter will yield a beneficial manipulation with non-negligible 
probabilhvv-at most polynomially small in q and n by Corollary 1.8. Thus in the setup 
of [B091, CS03, EL05, PR06, CS06], with positive probability, a single voter with black-box 
access to / can efficiently manipulate. This implies that approach of masking manipulations 
behind computational hardness cannot hide manipulations completely. 

We note that there are other (independent) extensions of [FKN09] for more candidates. 
Xia and Conitzer [XC08] applied the proof strategy of [FKN09] to show that for some social 
choice functions with n voters and a fixed number m of alternatives, starting with a uniformly 
random voting profile and then randomly resetting the ranking of one of the voters yields a 
manipulation pair with probability 17(1 /n). Their proof requires a number of properties of the 
social choice functions including anonymity (the social choice outcome depends only on the 
number of times each order was chosen), homogeneity (if each vote is replaced by t identical 
votes the outcome remains the same), canceling out (this condition related to neutrality - 
it says that one can cancel any subset of the votes which contains each order exactly once). 
Most importantly the results of Xia and Conitzer require that certain outcomes are robust 
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(will not change if a small linear fraction of the voters cast a specific order) and the result does 
not give bounds on the frequency of manipulations in terms of m, the number of alternatives. 
The later point implies that the results do not have implications for the hardness of finding 
a manipulation in the setup of [B091, CS03, EL05, PR06, CS06]. 

We further note that Dobzinski and Procaccia [DP08] established an analogous result for 
the case of two voters and any number of candidates, under a comparably weak assumption 
on the voting rule. 

1.3 Techniques 

The result of [FKN09] are obtained by mixing combinatorial techniques with discrete har- 
monic analysis. In contrast, our techniques are purely geometric and combinatorial. In 
particular, we apply a variant of the a canonical path method to prove isoperimetric bounds 
of "second order". These allow to establish the existence of a large interface where 3 bodies 
touch. As far as we know, our result is the first one to establish such a bound in any context. 

The canonical path method. Before describing our techniques, we briefly recall the 
canonical path method [JS90]. Given a graph G and a subset A of its vertices, a general 
approach to proving a lower bound on the 'surface area' of A — namely the number of vertices 
in A that are attached by an edge to a vertex outside of A — is as follows: for each pair x, y 
of vertices in G such that x 6 A and y ^ A, determine a path in G between them, called the 
canonical path between x and y. Since x is in A and y is not, there is at least one surface 
vertex on each canonical path. So if one manages to prove that each surface vertex lies on at 
most r canonical paths, it immediately follows that the surface of A contains at least 
vertices, giving the required lower bound on the surface area of A. 

Manipulation paths. Think of the graph G having the set Ul of all ranking profiles as 
the vertex set, where the pair (x, y) is an edge if x and y differ on at most one coordinate. 
A social choice function /: Ul — > [q] naturally partitions the vertices of G into q subsets. 
Our main interest is not in the surface area of these subsets, however, but in the number of 
manipulation points. 

Our approach in the proof of Theorem 1.5 is therefore the following: we consider four sub- 
sets / _1 (^4), / _1 (C) and f~ 1 (D), where the outcome is A,B,C and D respectively. 
We first use elementary methods to show that many edges in our graph lie on the interface 
between f~ l (A) and f~ 1 (B), namely have one vertex from each of the subsets. Similarly, 
many edges must lie on the interface between / _1 (C) and f~ 1 (D). 

We then define a so called manipulation path for each pair of edges consisting of one 
edge on the interface between / _1 (^4) and f^ 1 (B), and one on the interface between / _1 (C) 
and f~ l (D). The path (of edges) has the property that it either stays in one interface or 
the other. If a path "transitions" from the interface between f~ 1 (A) and f~ 1 {B) and the 
interface between _f -1 (C) and f~ 1 (D) then around the transition point the function must 
obtain at least 3 values. This realization allows us to apply the original Gibbard-Satterthwaite 
theorem and associate a manipulation point with the path. Much of the work is then devoted 
to bounding the number of paths that can correspond to each manipulation point. 
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A refined geometry. To obtain the improved parameters of Theorem 1.7 we use a proof 
scheme similar to that of Theorem 1.5, however we use an underlying graph with a different 
edge structure. Instead of connecting every pair x,y £ I™ of ranking profiles that differ in 
just one coordinate, we connect x and y only if in the coordinate i in which they differ, yi 
can be obtained from Xi by a single transposition. In the case where n = 1 this is the graph 
that's studied in the analysis of the adjacent transposition card shuffling [Ald83, Wil04]. The 
proof of the refined result requires to show that geometric and combinatorial quantities such 
as boundaries and manipulation points are roughly the same in the refined graph as in the 
original graph on L q . This proof requires the development of a number of techniques, in 
particular the study of canonical paths under group actions. 

1.4 Organization of the paper 

In Section 2 we set some notations, definitions, and some general observations. We prove 
Theorem 1.5 in Sections 3, 4 and 5. Theorem 1.7 is proved in Sections 6, 7, and 8. Finally, 
some open problems appear in Section 9. 

2 Setup and notation 

Rankings. We denote by L q the set of rankings of q alternatives. An element x € L q is a 
permutation of the set [q]. The elements ranked at top by x is x(l), the second is x(2) etc. 
Given another element y € L q , their composition yx is the ranking where the element ranked 
at the top is y(x(l)) etc. 

More generally we will also sometimes use L$ to denote the set of rankings of a set S. 

Definition 2.1 (neutral social choice functions). Let f : L q — > [q] be a social choice function. 
We say that f is neutral if for every x € L q and every y 6 L q , y{f{x)) = f{yx\, . . . ,yx n ). 
Informally f is neutral if the names of the alternatives do not matter when applying f . 

Influences and Variance. We call a function /: — > [q] a social choice function and 
define the influence of the i:th coordinate on / as Infj(/) = ~P(f(X) ^ /(J™)) where X is 
uniform on L q and XW is obtained from X by re-randomizing the i:th coordinate. Similarly 
we define the influence of the i:th coordinate w.r.t. to a single alternative a E [q] or a pair of 
alternatives a, b £ [q] as 

Inf l (/)=P(/(X) = a,/(X«)^a) 

and 

Inff (f) = P(f(X) = a,f(X®) = b) 

respectively. 

We also define the total influence of / as Inf(/) = Y17=l I ni i(/)- The following relationship 
is obvious, 

Proposition 2.2. For any f: L q — > [q], 

Inf i (/)=^Inf?(/)= £ Inff(/) (6) 

a=l a,b£[q]:a^b 
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The following standard proposition bounds the total influence with respect to a given 
candidate from below by the variance with respect to that candidate. 

Proposition 2.3. For any f : L™ — s> [q] and a E [q], 

n 

^Inff(/)>Var[l {/(x)=a} ] (7) 
i=i 

where X € L™ is uniformly selected. 

Proof. Create a random walk X = X(°\ . . . ,X^ = Y from X by re-randomizing the i:th 
coordinate in the i:th step, i.e. for i G [n], -X"W G L™ is obtained by re-randomizing the i:th 
coordinate of X^-' l ~ l \ Letting g{x) = l{fi x )= a } and using that X, Y are independent and that 
if g(X) / g(Y) then the value of g has to change at some edge on the path we have 

2Var[l {/(x)=a} ] = 2V a rg(X)=P(g(X)^g(Y))< 

n 

< P(U te[n] {g(X^) + g(X®)}) < E 2I <(/) 

i=i 

□ 

Further, if a function is far from all constants all such variances cannot be small: 
Lemma 2.4. For any f : L™ — >■ [g], 

D(/, CONST) <|t Var[l {/(x)=a} ] (8) 
a=i 

Proof. For a G [g], let /i a = P(/(X) = a) and assume w.l.o.g. that fii > ^2 > . . . > fi q . 
Then, 

D(/, CONST) = (1 - W ) < 9m (l - = § (1 - A«? - (1 - Mi) 2 ) < 

(9 \ 5 <? 

i-E^NfE^-^^fE Var [ 1 {/(X)=a}] 
a=l / a=l a=l 

□ 



3 Boundaries 

Lemma 3.1. Fix q > 3 and /: L n q ->• [g] satisfying D(/, NONMANIP) > e. T/ien tfiere exisi 
distinct i,j G [n] and {a,6},{c, d} C [a] suc/i f/iaf c ^ {a, 6} and 

Inff (/) > 2 , 2e and Inff (/) > ^ (9) 
nq A (q — 1) J ng z (g — 1) 
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Proof. For a ^ b let A a > b = [i G [n] \ luff > ^z^}- 

We first claim that for all {a, b} there exists {c, d} such that {c, d} ^ {a, 6} and A c ' d ^ 0. 
Note that / being e-far from taking two values asserts that we can find a c ^ {a, b} such that 
1 - | > P(/P0 = c) > > |. But then, by Proposition 2.3, 



E E K*</> = E a Var[l {/m . c) ] > l<i-^ > f&^> 

d+c i=l i=l q q 

hence there must exist some d ^ c and i G \n] such that Infj' d > > — m? £ 1N , and thus 

' L J « — nq^ — nq z (q— 1) ' 

A c - d / 0. 

We next claim that 

|U a , b ^ b |>2 (10) 

To see this, assume the contrary, i.e. U a ^A a,b C {i} for some i £ [n]. Then for all j ^ i it 
holds that 

Inf i(/ ) = £lnff (/) < q{q ~ l) 2 . 2£ = — (11) 
^— ' J 2 nq 2 (q— 1) nq 

For <j G Lq, let / CT (x) = /(xi, . . . , Xj_i, a, Xj+i, . . . , x n ) and note that for j / i, 
while Inf^/o-) = 0. Hence, by (11), we have 

n „ 

e > g^ Inf i(/) = lEE Inf ^^) > -E D ^- C0NST ) = 2D(/,DIGT0 

where the second inequality follows from Lemma 2.4 and Proposition 2.3. But this means 
that / is e/2-close to a dictator, contradicting the assumption that D(/, NONMANIP) > e. 

Hence (10) holds. Therefore we can either find i ^ j and {a, b} ^ {c, d} such that i G A a,b 
and j G A c,d which proves the theorem, or we must have |^4 a ' b | > 2 for some {a, 6} while 
^c,d _ £ Qr an y { C)C f} ^ {a, 6}. However, this contradicts the first claim in the proof. The 
result follows. □ 

As a simple corollary we have that assuming neutrality and q > 4 we may assume a, b, c, d 
are all distinct, 

Corollary 3.2. Fix q > 4 and suppose f: L™ — > [g] is neutral and satisfies D(/, DICT) > e. 
T/ien i/iere exist distinct i,j G [n] and distinct a,b,c,d G [g] swc/i £/ia£ 

Inff (/) > 6 and Inff (/) > 6 (13) 
nq z (q — 1) J ng z (g — 1) 

Proof. Neutrality of / implies that / is 1 — 2/g > 1/2 far from the set of functions taking at 
most 2 values. Since e < 1 it follows that D(/, NONMANIP) > e/2 Moreover, by neutrality, 
Inf°' b does not depend on {a, 6} so we can choose {a, b} and {c, d} non-intersecting. □ 
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4 First Construction of Manipulation Paths 



Similar to the definition of influence, let us now define /'s boundary in the i:th direction 
w.r.t. the alternatives a, b € [q] as 

B i' b U) = {( X ,V) I f( x ) = a J(y) = b,yj ^i: Xj = yj} 

The main idea of the proof is to define a canonical path between every pair of points on 
B°'' b and every pair of points on B^' d in a way such that each canonical path passes through 
a manipulation point while making sure that no manipulation point can be passed by too 
many canonical paths. We call the paths so constructed manipulation paths. 

Let us start with defining the canonical paths in terms of one voter. The main intuition 
behind the canonical paths is that in order to remain on B°"' b we require that we change 
rankings without changing the relative order of a and b. Similarly, in order to remain on B c - )d 
we require that we change the ranking without changing the relative order of c and d. 

We now define the graph that we are working with: 

Definition 4.1. The voting graph is the graph whose vertex set is L™ and whose edges are 
of the form x, y where Xj = yj for all j ^ i and X{ ^y%. 

We begin our definition of a canonical path by considering the case of one voter. 

Definition 4.2. Fix q > 4 and distinct a,b,c,d € [q\. Then the canonical path between 
x € L q and z G L q is x, y, z where y is obtained from z by swapping a and b if necessary in 
order to assure that a and b are in the same order as in x. This first step is called a Type I 
move while the second step from y to z is called a Type II move. 

Note that Type I moves preserve the order of a and b while Type II moves preserve the 
order of c and d. We can now define the manipulation paths used in the first proof. These 
paths go from points in Bf' b to B^' . To simplify notation we assume that i = n — 1 and 
j = n. The path is of length 2n and is defined by first making all type I moves and then 
making all type II moves. 

Definition 4.3. Let f: — > [q], (x,x') € B^_ x and (z,z r ) € Bn d , for distinct a,b,c,d E [q]. 
Then the canonical path T between (x,x') and (z,z') is 

(x,x') = (x(°\x'M), . . . , (x (n - 2 \x'( n -V), (z("- 2 ),z /(n ~ 2) ), • • • , (^ (< V (0) ) = (z,z'), 

where only coordinate k is updated at the k:th first step and the k:th last step, i.e. for all k 
and all s ^ k: 

(V(fc-i) T '(fc-ih - ( T (k) '(fch (Jk-l) jXfc-lh _ (Jk) >(k) ] 

and 

(fc-l) (k) (k) (k-l) 

Xk — X k , X k — Z k , Z k — Zk 

, _ /(fc-l) l(k) _ l(k) '(fc-l) _ / 

X k — x k , x k — z k , z k — z k 

are the canonical paths in Definition 
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5 Manipulation Points and First Proof 



Lemma 5.1. For any f:L^—> [q], distinct i,j € [n] and distinct a,b,c,d E [q] there exists 
a mapping h: B"'°(f) x Bf a (f) -> M where 

M = {x € L™ | / is manipulable at x} 

such that for any x € M 

\h- l {x)\<2n{q\) n+i . (14) 

Proof. Without loss of generality, let i = n — 1 and j = n. Fix (x, x') € Bf' b and (z, z') G f??' . 
Any edge on the canonical path between (x, x') and (z, z') connects two pairs of points. The 
left-most pair takes the values (a, b) since f(x) = a and f(x') = b while the right-most 
pair takes the values (c, d) . We claim that somewhere on the path there will be an edge 
(u,u'), (v,v') such that either 

I. at least one of u,u',v,v' is a manipulation point. 

II. / takes on at least three values on the points u,u',v,v'. 

To see this note that at least one of three things must happen: 

1. Somewhere along the first half of the path the values of the pair changes from (a,b) 
to something else. If the first value changes to b then f(x^) = a and /(x^ fe+1 ^) = b, 
but since the order of a, b are preserved under Type I moves either x^ k ' or x^ k+1 ^ must 
be a manipulation point. A similar logic applies when the second value changes to a. 
Otherwise, one of the values are not in {a, b} and therefore f takes on at least three 
values on the two pairs of this edge. 

2. Somewhere along the second half of the path - starting from the end - the values of the 
pair changes from (c, d) to something else. If the first value changes to d or the second 
value changes to c we have a manipulation point since the order of c, d are preserved 
under Type II moves. Otherwise, one of the values are not in {c, d}. 

3. The middle edge (x^ n ~ 2 \ x l<Jl ~ 2 ^), (z*" - " 2 ) , z'( n_2 )) connects a pair with values (a, b) and 
a pair with values (c, d) . 

Let (u,u'),(v,v') be the first edge where one of I. or II. holds and note that u,u',v,v' 
agree in all but two coordinates, either {n — 1, k}, {n, k} or {n, n — 1} depending on whether 
the edge (u,u'), (v,v') is on the first part of the path, the second part or is the middle edge. 

We now claim that we can find a manipulation point y such that u,u',v,v' and y agree 
in all but two coordinates. We will let h((x,x'), (z,z')) be this y. 

For case I. this is obvious and we can let y be the any of n, n', v , v which is a manipulation 
point. 

For case II., by applying the Gibbard-Satterthwaite theorem (Th. 1.2) on the restriction 
of / to the two coordinates on which u, u', v, v' differ we can identify a manipulation point 
y € which only differ from u,u',v,v' on these two coordinates and also is a manipulation 
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point of the original function / (if there is more than one possible manipulation point we can 
just pick say the lexicographically smallest one). 

It remains to count the number of inverses of a manipulation point y associated with the 
edge (u,u'), (v,v') which can be any of the 2n — 3 edges of the canonical path. Given the 
edge number and y, there are only (g!) 2 possibilities for u. Given u and the edge number 
there are only {q\) n possibilities for x and z. To see this note that for each k £ [n] we must 
have either 

• Uk = Xk- In this case there are g! possibilities for Zk- 

• Uk = Zk- In this case there are g! possibilities for Xk- 

• Xk,Uk,Zk is the canonical path from Definition 4.2 between Xk and Zk- Then there are 
^ possibilities for Xk and 2 possibilities for Zk- 

Finally, given x and z there are at most (g!) 2 possibilities for x' and z' . Overall we have: 

|/r 1 ( 2 /)|<(2»-3)(g!)«+ 4 (15) 

□ 

Proof of Theorem 1.5. By Corollary 3.2 we can find distinct i,j £ [n] and distinct a,b,c,d € 
[q] such that 

\Bf{f)\ > 6 (ql) n+1 and \Bf d (f)\ > C (g!)^ 1 (16) 
nq z (q — 1) J nq z {q — 1) 

Applying Lemma 5.1 we see that 

\Bf(f) x Bf{f)\ e 2 e 2 

M > ' 1 > ( Q i) n > (a') n (17) 

1 2n(g!)«+ 4 " 2n 3 g 4 (<? - l) 2 (g!) 2 " 2n 3 g 6 (<?0 2 



Hence, 



e 2 



P(/ is manipulable at X) > — (18) 

2n d q b {qiy 

□ 



6 Canonical Paths and Group Actions 

In order to derive the more refined result, we will need to consider in more detail the properties 
of the permutation group L q with respect to adjacent transpositions. Again we use canonical 
paths arguments. We state the arguments in a more general setup. 

Definition 6.1. Let L be a set. 

• Let Pl(£) denote the set of paths of length at most I in L and Pl = L>£ £ ^Pl(1) the set 
of paths of finite length. 
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• Let Li,L2 C I. A canonical path map on L from L\ to L<i of length £ is a map 
T: L\ x L2 — > Pl(£) which satisfies that T(x,y) begins at x and ends at y for all 
(x,y) G Li x L 2 . 

• Given a canonical path map T: L\ x L2 — > Pl(£) and < i < £ we define the inverse 
image mapping of the i'th vertex, T" 1 : L — > 2 LlxL2 as 

T-\z) = {(x,y) I length(r(x,y)) > i,T(x,y)i = z}. 

Further, we let 

r-\z) = uf =0 r-\z) 

• Given a group H acting on L we say that a canonical path map T: L\ x L2 — > Pl{£) is 
H -invariant if HL± = L\ and HL2 = L2 and 

T(hx,hy) = hT(x,y), 

for all h G H and all (x, y) G L\ x Li- 

We will use the following proposition. Recall that a group H acting on L is called fixed- 
point- free if for all x G L and all h G H different than the identity it holds that hx 7^ x. 

Proposition 6.2. Let H be a fixed-point-free group acting on L and let T: L\ x L2 — >■ Pl{£) 
be a canonical path map that is H -invariant. Then for all z G L and < i < I it holds that 

\r-\z)\ < MM (19) 

and 

\r~\z)\ < (£ + 1 , ) ' Ll||L2 ' (20) 

\H I 

Proof. Note that for all i, 

\l, xl 2 \>^ \r-\w)\ = £ \r-\hz)\ = \H\\r-\z)i 

w heH 

where the first inequality follows since the value of the i'th vertex partitions the set of paths 
of length at least i, the first equality since H is fixed-point-free, and the final equality from 
the path being if- invariant. We thus obtain: 



I -HI 

as needed. □ 
Two applications of the result above will be given for adjacent transpositions. 
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Definition 6.3. Given two elements a,b 6 [q] the adjacent transposition [a : b] between them 
is defined as follows. If x € L q has a and b adjacent, then [a : b]x is obtained from x be 
exchanging a and b. Otherwise, [a : b]x = x. 

We let T denote the set of all q(q — l)/2 adjacent transpositions. Given z £T, we define 



Inff ;2 (/) = P(f(X) = a,f(X®) = b) (21) 
InfH/) = P(/(X) = a,/(X«)^a) (22) 
Inff' T (/) = £lnff;*(/) (23) 



where J" is obtained from X by re-randomizing the i:th coordinate Xi in the following way: 
with probability 1/2 we keep it as Xi and otherwise we replace it by zXi. 

Finally for x 6 VI we will let [a : b]i x denote the element obtained by applying [a : b] on 
the i:th coordinate of x while leaving all other coordinates unchanged. 

Proposition 6.4. There exists a canonical path map T : L q x L q — >• Pi q (^) of length at most 
& = q(q ~ l)/2 < </ 2 /2, all of whose edges are adjacent transpositions such that for all z it 
holds that: 

\V~\z)\ < ^ (24) 

Proof. Given x, y £ L q consider the following canonical path starting at x and ending at y. 
Take the element y(l) ranked at the top for y and bubble it to the top by performing adjacent 
transpositions. Then take the element y{2) ranked second for y and bubble it to the second 
position etc. Clearly the length of the path is at most q(q — l)/2. Let H = {x h- > px \ p € L q } 
be the group of compositions with all possible permutations of the candidates. Since H is a 
fixed-point-free group acting on L q and the described canonical path map is //-invariant the 
result follows from Proposition 6.2. □ 

Corollary 6.5. For any f:L q —^ [q], a € [q] and i G [n] it holds that 

5>fn/)>llnf?(/), (25) 

where T is the set of all adjacent transpositions. 

Proof. This is a standard canonical path argument. Since both sides of the desired inequality 
involve averaging over all coordinates but the i'th coordinate, it follows that it suffices to 



prove the claim in the case where i = n = 1. Let B = {(u,v) € L q x L q | f(u) = a ^ 
f(v), 3z € T : v = zu} and note that 

£ Inf r(/) = §' (26) 



Consider the canonical path map T constructed in Proposition 6.4. Note that each canonical 
path between an element in A := {x G L q \ f(x) = a} and an element in A c must pass via 
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one of the edges in B. Define h : Ax A c — > B by letting h(x,y) be the first edge in B which 
T(x,y) passes through. Then by (24), for any (u,v) € B, 

^(M)! < ir- 1 ^)! < ^ (27) 

Thus 

, , \A\\A C \ , s 

\B\ > ^ (28) 

Combining (26) and (28) we obtain: 

V- T f a;z (f s . 1 \A\\A C \ 1 \A\ \A C \ 1 

z^ Inf i (/) ^ ^377 = ^^r^r = ^ Inf i(/) 



2q\ q\ q\ ,/-' 



□ 



A second application of Proposition 6.4 is the following. 

Proposition 6.6. Fix two elements a, b € [q] and let B C L q denote the set of all permuta- 
tions where a is ranked above b. Then there exists a canonical path map V : B x B — > Psiq 2 ) 
consisting of adjacent transpositions such that all permutations along the path satisfy that a 
is ranked above b. Moreover for all z it holds that: 

\r-Hz)\<q 4 q\ 

Proof. r(x, y) is defined as follows. We look at all elements different than a, b, starting with 
the top one of y, and bubble each of them upwards to its position in y ignoring a, b. After 
we have done so, we have all elements but a, b ordered as in y, followed by a, followed by b. 
We now bubble a to its location in y and then bubble b. Note that the length of the path so 
defined is at most 

Mi_I) +2( ,_ 1) = k±iHti) <92 

The proof now follows from Proposition 6.2 by considering the group H which acts by per- 
muting arbitrary all elements but those labeled by a and b: 

□ 

7 Refined Boundaries 

Similarly to the previous construction we now define the i:th a-b boundary with respect to 
an adjacent swap z € T as 

B i" Z (f) = {(x,y) | f(x) = a,f{y) = b,Xi = zy h Vj ^i: Xj = yj}, 
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and the boundary with respect to arbitrary adjacent swaps on the i:th coordinate as 

ir; if) = |J ir; M a) 

Note that for a ^ b, 

Inff ; *(/) = \ P(f(X) = a, f(zX) =b) = (29) 

7.1 Manipulation points on refined boundaries 

The following two lemmas identify manipulation points on these boundaries. 

Lemma 7.1. Fix f: L™ — > [q], distinct a, b € [q] and (x,y) £ B^' h,T . Then either Xi = [a : b]yi 
or one of x and y is a 2-manipulation point for f. 

Proof. Suppose Xi = [c : d]yi where {c,d} ^ {a, b}. Then an adjacent transposition of c 
and d will not change the order of a and b. Hence b > a iff b > a. But then either i) 

%i Hi 

f(y) = b>a = f(x) and x is a 2-manipulation point or ii) f{x) = a > b = f(y) and y is a 
2-manipulation point. □ 

Lemma 7.2. Fix f:L™—> [q] and points x,y, z £ L™ such that (x, y) £ B^ ,b,T y) € Bj' b ' T 
where a, b, c are distinct and i ^ j. Then there exists a 3 - manipulation point w G L™ for f 
such that Wk = yu f or k ^ {i,j} and W{ is equal to Xi or yi except that the position of c may 
be shifted arbitrarily and Wj is equal to Zj or yj except that the position of a may be shifted 
arbitrarily. 

Proof. By Lemma 7.1 we must have Xi = [a : b]yi and Zj = [c : b]yj, or x, y or z is a 
2-manipulation point in which case we are done. 

Now create a new triple (x',y',z') by starting from (x,y,z) and simultaneously in the 
i:th coordinate of x, y and z, bubbling c towards the pair ab until it becomes adjacent to the 
pair. Since c is never swapped with a or b during this process Lemma 7.1 implies that for 
any intermediate triple (x, y, T) we have f(x) = a, f{y) = b and f(z) {a, b}, or one of x, y 
and z is a 2-manipulation point. But since we also have z = [c : b]jy, we must actually have 
f{z) = c, or either y or z is a 2-manipulation point. 

Similarly bubbling a towards the pair be in coordinate j starting from (x',y',z') gives 
us x",y",z" all having a, 6, c adjacent in coordinates i and j such that (x",y") € B^' b ^ a ' b ^ 
and (z",y") € B c - ,h '^ c ' h \ Note that x",y",z" are equal except for a reordering of the blocks 
containing a, b, c in coordinates i and j. 

Now arbitrary adjacent swapping of a, b, c in these coordinates of x", y" and z" will keep 
the value of / in {a,b,c}, or give rise to a 2-manipulation point by Lemma 7.1. Thus we 
can define a social choice function with 2 voters and 3 candidates /' : L^ a b , — >• {a, c} by 
letting f'{v) = f(g(v)), where g(v) £ is obtained from x" by simply reordering the two 
blocks of elements a, b, c in coordinates i and j to match and V2, respectively. Since /' takes 
three values and is not a dictator, Gibbard-Satterthwaite (Theorem 1.2) implies that /' has a 
manipulation point and hence / has a 3-manipulation point satisfying our requirements. □ 
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7.2 Large Refined Boundaries 

Now we possess the right tools to prove the analogue of Lemma 3.1 for refined boundaries. 

Lemma 7.3. Fix q > 3 and f : L™ -> [q] satisfying D(/, NONMANIP) > e. Let X be 

uniformly selected from L 1 } . Then either, 

4e 

P(/ is 2-manipulable at X) > — ^ (30) 
or there exist distinct i,j £ [n] and {a,b},{c,d} C [q] such that c £ {a, b} and 

Inf"' 6;[o:6] (/) > A and Inff ;M (/) > A, ( 3 1) 

Proof. First, suppose that lnf* ,b ' z > for some i, a ^ b and [a : £>]. Then by Lemma 7.1 
for any point (x, x' ) G B?'°' z (f) at least one of x or x' = zx is a 2-manipulation point. Let 
M be the set of all such 2-manipulation points. Then 

|M| > \Bf'\f)\ = 2{q\) n Inff > z (f) > ^faO" (32) 

Dividing with (ql) n gives (30). Thus, for the remainder of the proof we may assume that 

lni a,b; Z< 2e V iG[n],{o,6}C[g] j2 r^[o:6] (33) 

Now, for a ^ b let A a > b = |ie[n]| hsfi'* 1 ™" 1 >^r}- 

We first claim that for all {a, b} there exists {c, d} such that {c, d} ^ {a, 6} and A c,d ^ 0. 
Note that / being e-far from taking two values asserts that we can find a c ^ {a, 6} such that 
1 - \ > P(/PQ = c) > r=2 > f • But then, by Corollary 6.5 and Proposition 2.3, 

E EX> f ^(/) = EEK^/) > £ Var[l {/W=c} ] > ^ 

w£T d+c i=l ioeT!=l y y 

hence there must exist some w £ T, d c and i € [n] such that Inf^' d,w > But by (33) 

we must have w = [c : d], hence A c,d ^ 0. 
We next claim that 

\U a , b A a > b \>2 (34) 

To see this, assume the contrary, i.e. U a fiA a,b C {i} for some i £ [n]. Then, by Corollary 6.5, 
for all j ^ i it holds that 

<«f£X>fr (/) = *■ e i < ;z (/)^t^ = ^ (^) 

zeT a zeT,a,b>a H H 
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For a € L q , let f a (x) = f(x±, . . . , er, Xj+i, . . . , x n ) and note that for j ^ i, 

ueL q 

while Infj(/ CT ) = 0. Hence, by (35), we have 

e > g^Inf^/) = ^^Inf,(/ CT ) > ~£D(/„, CONST) = 2D(/,DICT i ) 

where the second inequality follows from Lemma 2.4 and Proposition 2.3. But this means 
that / is e/2-close to a dictator, contradicting the assumption that D(/, NONMANIP) > e. 

Hence (34) holds. Therefore we can either find i ^ j and {a, b} ^ {c, d} such that i 6 A a ' b 
and j G A c ' which proves the theorem, or we must have |^4 a ' b | > 2 for some {a, b} while 
A c,d = for any {c, d} ^ {a, b}. However, this contradicts the first claim in the proof. The 
result follows. □ 

As a corollary we have that assuming neutrality and q > 4 we may assume a,b,c,d are 
all distinct, 

Corollary 7.4. Fix q > 4 and suppose f:Lq—> [q] is neutral and satisfies D(/, DICT) > e. 

Let X be uniformly selected from L™. Then either, 

P(f is 2-manipulable at X) > — = (37) 

nq' 

or there exist distinct i,j £ [n] and distinct a,b,c,d 6 [q] such that 

Inlf : " : ''(/! > -i= and Inff ;[c:d] (/) > A:, (38) 

Proof Neutrality of / implies that D(/, NONMANIP) > e/2 and that Inf"' fe does not depend 
on {a, b} so we can choose {a, b} and {c, d} non-intersecting. □ 



8 Refined Construction of Manipulation Paths 

We now present the second construction of manipulation paths. In this construction edges 
along the path will consist of adjacent transpositions instead of general permutations as in 
the previous construction. Again we construct manipulation paths between every edge on 

B aMa-b] and 

every edge on B-' ,L ' in a way such that each canonical path passes through 
(or "close" to) a manipulation point while making sure that no manipulation point can be 
passed by too many canonical paths. We call the paths so constructed refined manipulation 
paths. The main goal in the current construction compared to the previous one is to have 
better dependency on q, i.e. the number of inverse images of each manipulation point should 
be poly(n)poly(g , )g! instead of 2n(ql) 4 ql as in the previous construction. 

Let us first give two canonical paths on single coordinates that will be used as building 
blocks when constructing the refined canonical paths: 
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Proposition 8.1. Fix four elements a,b,c,d G [q]. Then there exists a canonical path map 
r : L q x L q — >■ Pr (g 2 + 2g) wi/t i/ie following properties: 

• T is a concatenation of two paths I and II. 

• The edges in I are arbitrary adjacent transpositions except [a : b], thus keeping the order 
of a and b fixed. 

• The edges in II are arbitrary adjacent transpositions except [c : d], thus keeping the 
order of c and d fixed. 

• For every y G L q there are exactly q\ pairs (x, z) G L q x L q for which the last vertex of 
I (first vertex of II) in the path T(x, z) is equal to y. 

• For all y G L q and i > we have \T^ 1 (y)\ < q 4 q\ 

Proof. First fix x, z G L q . If the order of c and d is the same in x and z then I has zero edges 
and consists only of the point x. Otherwise, I swaps the positions of c and d by first bubbling 
c to the position of d and then bubbling d back to the original position of c. II is constructed 
as in Proposition 6.6 while preserving the order of c and d. 

Note that the length of I and II is at most 2q — 2 and q 2 respectively. Further, fixing 
the last point of I to y, there are two possibilities for x and ql/2 possibilities for z. Hence, 
exactly q\ possible values for (x,z). 

Finally, by considering the group H which acts by permuting arbitrary all elements but 
those labeled by a,b,c and d and noting that \H\ = (q — 4)! it follows from Proposition 6.2 
that 

□ 

Proposition 8.2. Fix four elements a,b,c,d G [q\. Let 

X = {x G Lq | a,b are adjacent in x}, 
Then there exists a canonical path map T: XxL q — > Pi q {q 2 +2q) with the following properties: 

• r is a concatenation of three paths I, A and II. 

• All edges in I are adjacent transpositions not involving a and b, thus keeping the rank 
of a and b fixed. 

• The edges in II are arbitrary adjacent transpositions except [c : d], thus keeping the 
order of c and d fixed. 

• A consists of a single edge which is a reordering of a block of exactly the 4 elements 
a, b, c, d. 

• For every y G L q there are at most 2q 3 q\ pairs (x, z) G L q x L q for which the last vertex 
of I in the path T(x, z) is equal to y. The same holds for the first vertex ofU. 
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For all y G L q and i > we have \T- 1 (y)\ < 2g 3 g! 



Proof. Fix x G X and z € L q . The path I is constructed by first bubbling the element c 
towards the block ab until it is adjacent to this block and then doing the same with d. 

A consists of a single edge which reorders the block of a, b, c and d so that the order 
matches that in z. 

n is constructed as in Proposition 6.6 while preserving the order of c and d. 
Note that the length of I and II is at most 2q — 1 and q 2 respectively. 
Finally, by considering the group H which acts by permuting arbitrary all elements but 
those labeled by a, b, c and d it follows follows from Proposition 6.2 that 

KHv)\ < ^ < < 2i 4 (40) 

The other properties are easy to verify. □ 

We are now ready to define the canonical path from s"' 6 '^ a ' b '(/) to Bj'^'^^f). This path 
is over (L q ) 2 . If we only consider the first element of each such pair, then the path can 
informally be described as being constructed by concatenating three paths I, A and II where 
I is constructed by updating one coordinate at a time, using the path I of Proposition 8.1 
for each coordinate k £ {i,j}, using the path I from Proposition 8.2 for coordinate i and 
finally for coordinate j using the reverse of the path II of Proposition 8.2 where the role of 
elements a, b have been interchanged with that of c, d. The path A do the middle step from 
Proposition 8.1 for both i and j. The path II then updates each coordinate again using the 
remaining part of each path above. 

Proposition 8.3. Fix four distinct elements a,b,c,d G [q] and distinct i,j G [n]. Let 

X = {(x, x') G (L™) 2 [ x = [a : 6]< x , x + x) 

and 

Z = {(z,z') G (L n q ) 2 | z' = [c : d] 3 z , z' + z) 

Then there exists a canonical path map T: X x Z — > P^n-j2(2n(q 2 + 2)) with the following 
properties: 

• r is a concatenation of three paths I, A and II. 



I stays in X and for all edges ((v, v'), (w, w')) in I both (v,w) and (v',w f ) consist of 
single adjacent transpositions that preserve the order of a and b in each coordinate and 
keep the rank of a and b fixed in coordinate i. 

II stays in Z and for all edges ((v,v'),(w,w')) in II both (v,w) and (v',w') consist of 
single adjacent transpositions that preserve the order of c and d in each coordinate and 
keep the rank of c and d fixed in coordinate j . 

A consists of a single edge ((v, v'), (w, w')) such that v,v',w,w' are all equal up to a 
reordering of a block of elements a, b, c, d in coordinates i and j. 
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• For any (v,v') G (L™) 2 we have \T ((v,v'))\ < 7nq 12 (q\) n 

Proof. To define r fix a starting pair (x, x') G X and an ending pair (z, z') G Z. For this 
pair, the paths I and II are both constructed as a concatenation of n paths: 



In order to define these paths first note that since I must stay in X, every vertex (v, v') in 
I must satisfy v' = [a : b]iV. Thus it is enough to describe the projection of I to the first 
coordinate of each pair. Let I be this projection (so that if the j'th vertex of I is (v, v'), then 
the j'th vertex of I is v ). Similarily since II must stay in Z, every vertex (v, v') in II satisfies 
v' = [c : d]jV and it is enough to describe II - the projection of II to the first coordinate of 
each pair. 

Now, for any path T = (u(0), . . . , u(l)) G P^n let = (ufc(O), . . . , Uk(i)) denote its 
restriction to coordinate k. The projections I and II can then be defined as follows, 

• For any k = 1, . . . , n— 1 the last vertex of I(k) is equal to the first vertex of l(k + 1), 
and the last vertex of H(k) is equal to the first vertex of Tl(k + 1). 

• yk,m ^ k : l m (k) and H m (k) are constant paths, i.e. I(k) and H(k) only change in 
coordinate k. 

• V/c ^ {i,j} ■ Ifc(fc) and Hk(k) are the paths I and II making up T(xk,Zk) in Proposi- 



• Ij(i) and IT(i) are the paths I and II making up T(xi,Zi) in Proposition 8.2. 

• and Hj(j) are, respectively, the reverse of the paths II and I making up T(zj,Xj) 
in Proposition 8.2 with the role of (a, b) there swapped with that of (c,d). 

Note that this uniquely determines A as the single edge from the last vertex of I to the first 
vertex of II. The three statements about the edges of T now follow from Proposition 8.1 
and 8.2. 

Finally, to compute |T ((v,v'))\ for (v,v') € (-^g) 2 we need to count the number of 

(x, x') G X and (z, z') S Z such that (v, v') is a vertex on the path. Note that |T 1 ((v, v'))\ = 
unless (v,v') G X or (v,v') G Z. Without loss of generality assume that (v,v') G X (the 
argument for (v,v') G Z is symmetric). 

Then v could belong to any of the n paths 1(1), . . . ,I(n). Suppose it belongs I(m). No 
matter what m is, v can be any of at most q 2 + 2q-\- 1 vertices on the path I(m). If m ^ {i, j} 
then by Proposition 8.1 there can be at most q 4 q\ possibilities for (x m , z m ), and if m G {i, j} 
then by Proposition 8.2 there can be at most 2q s q\ < q 4 q\ possibilities for (x m ,z m ). For all 
other coordinates fc/mwe have that Vk equals either Xk or the last vertex of l(k). In both 
cases there are by Proposition 8.2 at most 2q 3 ql possibilities for (xk,Zk) if k G and by 

Proposition 8.1 exactly q\ possibilities for (xk,Zk) if k ^ {i,j} and Finally, since (x,x') G X 
and (z, z') G Y there is at most one possibility for x' and z' given x and z. Hence we have, 



I = I(l),...,I(n) and II = 11(1), . . . , U(n) 



(41) 



tion 8.1. 



r \(v,v'))\ <n(q 2 + 2q + l)q 



< 4 q\{2q 3 q\) 2 (ql) n - 3 < 7nq 12 (q\) 



n 



(42) 



since q > 4. 



□ 
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8.1 Proof of Theorem 1.7 

Our main claim is the following 

Lemma 8.4. For any /: Ul — > [q], distinct i,j G [n] and distinct a,b,c,d G [q] there exists 
a mapping 

M = {x G Lg I / is 4-manipulable at x} 

such that for any x G M 

\h~\x)\ < 10 4 nq w (q ] -) n (43) 

Proof. Fix (x, x') G Q^ b ^ a - b ^ anc [ ^ g 5^ (/). Then there exist a refined canonical 
path T = T((x,x'), (z,z')) (being a concatenation of three paths I, A and II) satisfying the 
properties of Proposition 8.3. We now claim the following: 

Claim: Somewhere on this path there will be a vertex (v, v') such that v is close to a 
4-manipulation point y, in the sense that it differs from y in at most 2 coordinates, and in 
each of those two coordinates it only differs by a reordering of the elements a, b, c and d and 
an arbitrary shifting of a single element in [q\. 

We will take h({x,x'),(z,z')) to be an arbitrary 4-manipulation point y satisfying the 
closeness requirement in the claim for some vertex on the path. 

Now note that along this path at least one of the following three things must happen: 

1. Somewhere along the first part I of the path there is an edge ((v, v'), (w, w')) such that 
(f(v)J(v')) = (a, b) but (f(w)J(w')) + (a, b). 

2. Somewhere along the second part II of the path there is an edge ((v,v'), (w,w')) such 
that (/(«),/(«')) + (c,d) but (f(w)J(w')) = (c,d). 

3. Let ((v, v'), (w, w')) be the single edge in A. Then (f(v), f(v')) = (a, b) and (f(w), f(w')) = 
(c,d). 

We argue that the claim follows in each of these cases: 

1. If e := f(w) a, Lemma 7.1 implies that w = [a : e]tv for some k G [n] (else v or w is 
a 2-manipulation point, yielding the claim). Since the order of a and b is preserved in 
all coordinates in I we must have e ^ 6. Further k 7^ i, since the rank of a is preserved 
in coordinate i in this part of the path. Thus (v, v') G B°l' b,T and (v, w) G B^ e,T and 
Lemma 7.2 implies that there is a 3-manipulation point y which only differ from v, v ' , w 
and w' in coordinates i and k. Furthermore, y^ is equal to Vk or Wk except that the 
position of b may have been shifted arbitrarily, and yi is equal to Vi = uii or v[ = w[ 
except that the position of e may have been shifted arbitrarily. Thus it is either close 
to v or w, in the sense of the claim. 

The other possibility is that e := f(w') 7^ b, for which the claim follows by an analogous 
argument (remembering that v and v' only differ by an adjacent swap of a, b). 

2. The claim again follows analogously to the previous case. 
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3. In this case Proposition 8.3 guarantees that v,v',w,w' only differ by a reordering of 
adjacent blocks of elements a,b,c,d in coordinates i and j. Thus we may define a 
new social choice function /' : L 2 abcd y — > {a,b,c,d} by letting f'(u) = f(g(u)) where 
g(u) € Lq is obtained from v by simply reordering the two blocks of elements a, b, c, d 
in coordinates i and j so that they match u\ and 112 respectively. Note that this 
reordering can be done using adjacent transpositions involving a, b, c and d only. Hence 
by Lemma 7.1, Vu : f(g(u)) € {a,b,c,d}, or else one of the intermediate points under 
this reordering using adjacent transpositions must be a 2-manipulation point, yielding 
the claim. 

So we may assume that /' is well-defined, i.e. takes values in {a,b,c,d}. However 
since /' takes on all four values and is not a dictator, Gibbard-Satterthwaite (Theo- 
rem 1.2) implies that /' must have a manipulation point u but then g(u) must be a 
4-manipulation point of /, proving the claim. 

Now fix y 6 M. In order to count \h~ 1 (y)\ note that there can be at most (4!g 2 ) 2 values 
of v satisfying the closeness requirement to y given in the claim. Given v there are only 2 
possibilities for the vertex (v,v r ) (depending on whether the vertex is in I or in n). Further, 
by Proposition 8.3 their can be at most 7nq l2 (q\) n canonical paths containing any specific 
vertex. Thus, 

\h-\y)\ < 2(A\q 2 ) 2 7nq 12 (q\) n < 10 4 nq le (q\) n (44) 

□ 

Proof of Theorem 1.7. By Corollary 7.4, either we are done or we can find distinct i,j € [n] 
and distinct a,b,c,d G [q] such that, by (29), 

j a, 6;M(/)| 2^ wr ^ > , )B (45) 

nq' J nq' 

Let M = {x € L™ \ f is 4-manipulable at x}. Applying Lemma 8.4 we see that 

|R^M (/ ) xB ^H(/)| 4e 2 
IMI > — — 3 - — > — — (aT (46) 



Hence, 



e 2 



P(/ is 4-manipulable at X) > ^ 4 3 3Q (47) 

□ 



9 Open problems 

We list a few natural open problems that arise from our work. 

• In Corollary 1.8 we prove that a random pair x, y is a manipulation point with non- 
negligible probability, if y is obtained from 1 by a random change in 4 adjacent alter- 
natives, applied to a random coordinate. For the case where y is obtained from x by 
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simply re-randomizing one of the coordinates, which is the one considered in [FKN09], 
we only have a lower bound where q\ appears in the denominator (see Corollary 1.6). 
It would be interesting to prove a polynomial lower bound in the latter case. 

• As is often the case with arguments involving canonical paths, we suspect that the 
parameters we obtained are not tight. It would be interesting to find the correct tight 
bounds. In particular, we are not even sure that the lower bound on the number of 
manipulation points must decrease with q — the correct bound may even increase as a 
function of q for neutral functions. 

• Our results, as well as those of [FKN09], apply only to neutral functions. Can one prove 
a quantitative Gibbard-Satterthwaite theorem for non-neutral functions? 

• It would also be interesting to consider the Gibbard-Satterthwaite theorem quantita- 
tively for non-uniform distributions over preferences. 
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